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Abstract 

The mincut graph bisection problem involves partitioning the n vertices of a graph 
into disjoint subsets, each containing exactly n/2 vertices, while minimizing the number 
of "cut" edges with an endpoint in each subset. When considered over sparse random 
graphs, the phase structure of the graph bisection problem displays certain familiar 
properties, but also some surprises. It is known that when the mean degree is below 
the critical value of 2 log 2, the cutsize is zero with high probability. We study how 
the minimum cutsize increases with mean degree above this critical threshold, find- 
ing a new analytical upper bound that improves considerably upon previous bounds. 
Combined with recent results on expander graphs, our bound suggests the unusual sce- 
nario that random graph bisection is replica symmetric up to and beyond the critical 
threshold, with a replica symmetry breaking transition possibly taking place above the 
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threshold. An intriguing algorithmic consequence is that although the problem is NP- 
hard, we can find near-optimal cutsizes (whose ratio to the optimal value approaches 
1 asymptotically) in polynomial time for typical instances near the phase transition. 

1 Introduction 

The graph bisection problem arises in a wide range of contexts, ranging from VLSI de- 
sign [Tj to computer vision [9l[23] to protecting networks from attack [18J. The basic mincut 
graph bisection (or graph bipartitioning) problem is defined as follows. Given an undirected, 
unweighted graph, partition its n nodes into two disjoint subsets of equal size, while mini- 
mizing the number w of "cut" edges with an endpoint in each subset. This is an NP-hard 
optimization problem [Ti] . 

From a statistical physics perspective, graph bisection is also equivalent to finding the 
ground state of a ferromagnet constrained to have zero magnetization [221 [23, [211 [22] • 
Over the past decade, the study of combinatorial optimization problems within the physics 
community as well as the study of statistical physics approaches within the computer science 
community has led to remarkable successes [32] . By considering optimization problems over 
appropriately parametrized ensembles of random instances, physicists have developed meth- 
ods leading both to analytical predictions of global optima and to algorithmic approaches 
for finding these. One particularly rich source of insight has been the investigation of phase 
transition behavior: for many combinatorial optimization problems, both the nature of the 
solution space and the average-case complexity of algorithms solving the problem are closely 
related to an underlying phase structure. 

In this paper, we consider graph bisection over Erdos-Renyi (Bernoulli) random graphs [12] 
generated from the Qnp ensemble: given n nodes, place an edge between each of the (2) pairs 
of nodes independently with probability p. We are interested in the regime of sparse random 
graphs, where p scales as l/n and so a vertex's mean degree p{n — 1) is some finite a. On 
such graphs, it is known that a phase transition occurs at ac = 2 log 2 [161 EZl [211 EHl [26] . 
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For a < ac, typical graphs can be bisected without cutting any edges: the bisection width 
w is zero. For a > ac, the graph typically contains a connected component (giant compo- 
nent) of size greater than n/2 and so a bisection requires cutting edges, giving w > 0. In 
other combinatorial optimization problems such as graph coloring or satisfiability, a similar 
sharp threshold coincides with a rapid escalation of average-case computational complex- 
ity [m [301 [32]. 

Additionally, for many of these other problems, at some ad < etc the solution space 
undergoes a structural transition that corresponds to replica symmetry breaking [H [29] . 
Define two optimal solutions to be adjacent if the Hamming distance between them is small: 
an infinitesimal fraction of the variables differ in value. Then below a^,, any two solutions 
are connected by a path of adjacent solutions. The solution space may be thought of as a 
single "cluster". This is a replica symmetric (RS) phase. But above a^, the cluster fragments 
into multiple non-adjacent clusters, differing by an extensive number of variables. This is a 
replica symmetry breaking (RSB) phase. In cases where < ac, such as 3-COL and 3-SAT, 
this structural transition may contribute to the rise in algorithmic complexity as the main 
threshold ac is approached from below [28] . 

We find that the scenario is quite different for graph bisection. Rather than the RSB 
transition taking place below ac, our arguments suggest that it actually occurs somewhere 
above ac- Graph bisection is the first combinatorial optimization problem we are aware 
of that displays this behavior. Furthermore, in 1987, Liao proposed an analytical solution 
for bisection width [25] that assumes replica symmetry above ac, so our results help 
validate this prediction. Finally, the same arguments that indicate replica symmetry also 
suggest an algorithm for finding extremely close approximations to the optimal solution, for 
a G {ac,ad), in polynomial time. 

The rest of the paper is organized as follows. In Section [2] we study the bisection width 
(or "cutsize") w as a. function of mean degree a. We present numerical results and derive a 
new upper bound by analyzing a simple greedy heuristic. In Section [3] we study the solution 



3 



cluster structure, and argue that the problem is replica symmetric through and beyond the 
critical threshold Oc = 2 log 2. In Section H] we conclude with a discussion of the physical 
and algorithmic consequences of this unusual phase structure. 

2 Bisection Width 

Formally, the problem is as follows. Given an undirected, unweighted graph G = {V, E) 
where \V\ = n is even, a valid solution is a partition of the vertex set into two disjoint 
subsets Vi and V2 such that |Vi| = IV2I = n/2. An optimal solution is one that minimizes 
the bisection width w = \E{Vi, V2)|, i.e., the number of "cut" edges with an endpoint in each 
subset. We consider graphs G drawn uniformly at random from Q^p, and take our control 
parameter to be the graph's mean degree a = p{n — 1). 

The Qnp ensemble fi2\ [8] has been studied very extensively since its introduction by Erdos 
and Renyi in 1959, and many of its structural properties are known. These have a crucial 
effect on the bisection width. 

• For a < 1, the graph consists only of small components, the largest being asymptoti- 
cally of size O(logn). The fraction of monomers, or isolated vertices, is asymptotically 
almost surely (a.a.s.) itself at least e~^. Consequently, with high probability a perfect 
bisection {w = 0) exists. 

• For a > 1, there is a unique giant component containing 0{n) vertices. The second- 
largest component is asymptotically of size O(logn). As long as a is below the threshold 
ftc = 2 log 2, the giant component a.a.s. contains a fraction g < 1/2 of the vertices, and 
the fraction of monomers is still above 1/4. This allows one to prove flGl [26] that with 
high probability, w = for all a < 2 log 2. 

• For a > 2 log 2, g > 1/2, and so bisecting the graph requires cutting some of its edges. 
It is known [26] that w is extensive (scales as n) in this regime, as long as a is finite. 
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Figure 1: Presumed optimal bisection width, found by extremal optimization on random 
graphs for n = 2000. Line shows fit to 7(0 — Oc)^, with 7 f» 0.0783 and /3 ~ 1.46. See 
also [S]. 

How does the bisection width constant w/n behave for a > Figured] shows simulation 
results using the extremal optimization [6] heuristic to find the presumed optimal bisection. 
These suggest that the bisection width undergoes a continuous transition at Oc, and that 
its first derivative is continuous as well. However, neither of these properties has yet been 
shown analytically. We will do so now, by deriving a straightforward upper bound on w. We 
give the main elements of the derivation here; a more rigorous mathematical treatment will 
be provided elsewhere [20] . 

We obtain the upper bound by "stripping" trees off of the giant component. For random 
graphs with mean degree a, it is known that the fraction g of nodes that are in the giant 
component is given a.a.s. by [12] 



and the fraction h of nodes that are in trees within the giant component (the mantle [33]) is 

cL. cL. S. 
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Figure 2: Schematic of giant component consisting of a mantle of trees and a remaining 
2- core. Greedy upper bound algorithm cuts trees of decreasing size. Numbers show order of 
cuts, depicted by dashed hues. 

Now imagine a greedy process (Figure [2]) of removing trees, starting from the largest one 
and decreasing in size, until the giant component has been pruned to size < n/2. In order 
to know how many trees need to be removed, consider the distribution of tree sizes, P{t). A 
fortunate consequence of the probabilistic independence in Qnp is that P(t) is simply given 
by the number of ways of constructing a tree of size t from q roots and r other nodes, where 
q = {g — b + a{n))n is the number of nodes in the 2-core of the giant component (all nodes 
not in the mantle) and r = (6 + T{n))n is the number of nodes in the mantle, with cr(n) and 
r(n) being random functions that go a.a.s. to zero [211 12] • A basic combinatorial argument 
then gives 




(3) 



and for large n, letting p = b/g, 
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for t > 1. 
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Let z be the total number of trees. Then, given the total number of nodes in all 

trees is Xlt^i tP{t)z. As this is simply the size of the mantle, it holds a.a.s. that 



(5) 



Let us evaluate this expression. From normalization of P{t), 



and differentiating with respect to p, 



i_i^f M!^.^. (7) 



The left-hand expression is simply so Eq. ([5]) becomes 



^ Ki-Pf- (8) 



We now wish to set to so that the number of nodes living on trees of size > to is just 
barely enough to cover the "excess" of the giant component, 6 = g — 1/2. In that case, to 
satisfies the property 

oo 
t=to + l 

i=io ■ 
oo 

, Z 



> pe-''-V2(to + l)P(t) (12) 
n ^-^ 

t=to 

Given to satisfying both this and the corresponding lower bound on S, bisecting the graph 



is simply a matter of cutting each tree of size > to- Any resulting imbalance in the partitions 
(from "overstripping" ) can easily be corrected by transferring enough monomers: there is an 
extensive number of these, whereas the largest trees are only of size O(logn). Thus, 

oo 

- < -E^W (13) 
n n ^ — ^ 

t=to 

< Y'K (14) 
2(io + l) ^ ' 

from Eq. (fT2|) . In order for the tree-stripping procedure to work, the mantle must contain 
enough vertices to strip, and so we know that b > 6. One can solve Eqs. ([T]) and ([2]) numeri- 
cally to find that this imposes the condition g < cq with constant cq ~ 0.813, corresponding 
to a < 2.06. For ac < a < 2.06, it is straightforward to confirm that 6{pe^P)^^ < a — ac, 
giving 

- < —, -:■ (15) 

n 2(to + l) 

Finally, we bound to itself. A version of Stirling's formula gives the inequality t\ < 
t*e-*y27rt/(l - l/2t), which we apply in Eq. ([10]): 

^ > £ f ^Mfll^ (16) 

t=to + l ^ 

> Kl - Pf Vijl - l/2t)p^e-^* 



V27rp 
b{l - pf {pe'-PY°+' 



V27ip ^4^, 



^ . 1 ■ (19) 

1-pe^-P ^ ' 

We have just seen that a — ac> 6{pe~P)~^ in the regime of interest, so 

a-ac> ^ Kl-P)' (pe^^O*"^'- (20) 
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Then letting e = a — 



+ log(l/pe-P) • 

It can be verified that the second term in the numerator is monotone increasing in g and is 
thus bounded below by its value at (7 = 1/2 (the smallest possible giant component size for 
C( > ttc); which corresponds to p = log 2. This gives 

^o + l>^^#f^ (22) 
log(l/pei-P) ^ ^ 

where 

c, = - log ^ 1.25. (23) 

v^log2(l -elog2/2) 

Finally, the denominator in Eq. (122|) is also monotone increasing in g, and thus bounded 

above by its value where the mantle is exhausted at (7 = cq ~ 0.813, which corresponds to 

p = 1 — l/2co. Therefore, 

J, + 1 > !2I(V!)^ (24) 

C2 



C2 = — ^ -logfl - — I ^ 0.339. (25) 



where 

2co V 2co 
Inserting this to bound into Eq. f|T5|) gives the bisection width bound 

n 2 log(l/e) -ci' ^ ' 

We stress several points regarding this bound. First of all, the result closes the gap 
between upper and lower bounds on w/n ai a = Oc- The unsurprising fact that both 
bounds go to zero and the bisection width is continuous at the transition is already apparent 
from Eq. (IT^ . The less obvious property that its first derivative (with respect to a) is also 
continuous at the transition follows from Eq. ( l26l) . Second of all, the denominator of Eq. (l26l) 
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is only positive for < e < l/e'^^. Numerically, this is roughly 1.39 < a < 1.67, which covers 
a significant fraction of the regime of interest 1.39 < a < 2.06. Finally, from Eq. ( 12^ it is 
clear that to diverges at the transition, and for sufficiently small e it satisfies the even simpler 
bound to > log(l/e)- These facts will be significant for the arguments in the next section. 

3 Solution Cluster Structure 

We now consider the geometry of the solution space for random graph bisection. For a 
graph G G Qnp-, given two distinct optimal bisections A and B, define the relative Hamming 
distance yu(v4, B) to be the fraction of nodes that are in one of the subsets (Vi or V2) in 
solution A but in the opposite subset in solution B. Since there is a global symmetry under 
exchange of Vi and V2, we are interested in d{A, B) = min(/i(y4, B),l — fi{A, B)). 

Define A and B to be r-adjacent if d{A, B) < r. For some small r, any two solutions that 
are connected by a chain of r-adjacent solutions are said to be in the same solution cluster. 
Asymptotically, for a given mean degree a, we would like to know what the limiting cluster 
structure is when r becomes an arbitrarily small constant. One scenario that is common 
for sufficiently underconstrained combinatorial optimization problems is replica symmetry: 
for any finite r > 0, all solutions are with high probability within a single cluster. Another 
scenario, common for more constrained instances, is one-step replica symmetry breaking: for 
sufficiently small r, there is a large number of disconnected clusters, separated from each 
other by at least some finite relative Hamming distance. 

In what follows, if solutions constructed in a particular way are with high probability 
r-adjacent for any finite r > 0, we will refer to them as being simply adjacent. 

3.1 a<21og2 

Consider graphs where optimal bisections require no cuts in the giant component. We first 
argue that we can almost surely create such a bisection by populating one of the two subsets 
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only with the largest component and with monomers and dimers (components of size 2). We 
then use arguments in to show that having created a bisection in this way, it must be 
in the same cluster as all optimal bisections. 

As discussed in Section [21 in Qnp graphs for large n, a fraction e~" of nodes are monomers. 
What about dimers? For a pair of nodes to be a dimer, it must contain an edge (probability 
p ^ a/n), and all other pairs involving one of its two endpoints (2n — 4 of these) must not 
contain an edge (probability l—p): asymptotically, this occurs with probability ae~'^°'/n. So 
the expected number of dimer pairs is ~ ae~'^°'n/2, and a.a.s. the fraction of nodes in dimers 
is ae~^". Thus, a.a.s. the fraction of nodes in monomers or dimers is s = e~"(l + ae~"). 

For a < 1, s > 1/2. For 1 < a < 2 log 2, s falls below 1/2, but one may verify that 
g + s > 1/2 where g is the size of the giant component given in Eq. (JT]). Therefore, as long 
as a < 2 log 2, we a.a.s. surely have enough monomers and dimers to fill up a subset that 
otherwise contains only the largest component (or one of them, if not unique). Call such 
optimal solutions clean bisections. 

Next, we show that any two distinct clean bisections A and B (and hence all clean 
bisections) must be in the same solution cluster. If the largest component is unique, A and 
B can differ at most by the choice of which monomers and dimers are selected to be in its 
subset. If the largest component is not unique, it cannot be a giant component, so two clean 
bisections can then differ at most by: 1) a component consisting of a vanishing fraction of 
nodes, and 2) monomers and dimers. In both cases, we can take any one of the differing 
components in A and swap it with monomers and dimers from the other subset, producing 
an adjacent clean bisection. By iterating the process, we will necessarily arrive at B through 
a chain of adjacent clean bisections, since we have already shown that there is a sufficient 
supply of monomers and dimers for this purpose. 

Finally, it is straightforward to see that any optimal solution must be in the same cluster 
as some clean bisection. This holds for almost the same reason as in the previous paragraph. 
If the subset containing the largest component also contains any components other than 
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monomers and dimers, each of these asymptotically consists of a vanishing fraction of nodes. 
They may be iteratively swapped with monomers and dimers to produce a chain of adjacent 
solutions, leading to a clean bisection. 

Since all clean bisections are in the same cluster, and any solution is in the same cluster 
as some clean bisection, all solutions must lie within a single cluster for a < 2 log 2. This is 
a replica symmetric phase. 



When the bisection width is nonzero, the situation is more complicated. Now we must 
consider the different ways in which optimal bisections can cut the giant component. We will 
show that up to some ad > ac, optimal bisections avoid a central core of the giant component 
and are restricted to a well-defined periphery. We then give arguments suggesting that for 
those graphs, all optimal bisections are in a single cluster. 

3.2.1 The expander core 

Call the giant component S, and imagine that for any a > 2 log 2, some identifiable part 
of it, X C 5*, is an expander. This means that cutting off any piece Xq from X requires 
(asymptotically) at least some finite expansion 



i.e., the number of cut edges per vertex in the broken piece. The expander X would then 
be precisely the central core that optimal bisections avoid, for e = a — ac less than some 
constant. The reason is as follows. We have seen in Section [2] that when cutting trees only, 
the smallest tree needed is of size 



3.2 a>21og2 
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Figure 3: Reciprocal of spectral gap. For mean degree a = 1.4 through a = 2.0, on graphs 
of size n = 1000 up to n = 16000 this quantity vanishes roughly as ~ 1/ logn. 

and so the largest expansion is 



For sufficiently small e, this must fall below the (finite) value of h for the expander. Thus, 
below some finite eo > 0, trimming the giant component's excess requires fewer cuts if these 
are performed exclusively within the mantle than if any part of the cuts are performed within 
the core. 

One might hope that the 2-core is an expander, in which case greedily trimming trees 
would in fact be the optimal way to bisect a graph. As it happens, this is almost certain 
not true. Consider the 2-core's spectral gap, the difference Ai — A2 between the two largest 
eigenvalues of its connectivity matrix. Via Cheeger's inequality ^Q\, this yields an upper 
bound on expansion. Figure [3] suggests that the spectral gap likely vanishes (albeit slowly) 
as n — » 00, implying that the 2-core's expansion vanishes, presumably due to the existence 
of cycles of length ~ logn. Certain "bottlenecks" in the 2-core could then provide more 
efficient cuts than trees in the mantle. 

But some further intuition comes from considering random cuts in the 2-core. Figure H] 



h < 



1 



(29) 
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Figure 4: Smallest expansion over a random sample of ways of slicing the 2-core into two 
pieces. Randomly chosen nodes are sequentially separated from the 2-core until half of the 
2-core has been cut. For each value of mean degree a, and for each size from n = 1000 to 
n = 32000, this random sequence is repeated 10 times on each of 100 graphs. Results show 
the lowest expansion value found over the course of all 10 sequences and over all 100 graphs. 
For increasing graph sizes, this value approaches 1. 

shows the minimum expansion from a number of ways of randomly slicing the 2-core into two 
pieces. These numerical results, which we originally reported in [17] , suggest that for large n 
the minimum expansion never falls below a constant. This means that any bottlenecks that 
do exist in the 2-core are not immediately apparent, and some significant part of the 2-core 
could in fact be an expander. 

Subsequent results by Benjamini et al. [2] have confirmed that this is indeed the case: 
for all a > 1, they give an explicit procedure for stripping the giant component down to an 
expander core. The giant component is a "decorated expander," with many small (O(logn) 
at the largest) "decorations" Di attached to the expander core X, as shown schematically in 
Figure [51 Therefore, by our previous argument, below some finite eg, the cuts in all optimal 
bisections are restricted to the decorations Di. 
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Figure 5: A different view of the giant component in Figure [2], shown here as an identifiable 
expander core X within the 2-core. The trees are now contained within numerous small 
decorations Di that are attached to the expander core. 

3.2.2 Consequences for solution structure 

We now use arguments related to the ones in Section [3A] to suggest that for e < eo (« < = 
2 log 2 + eo), all optimal solutions are contained within a single cluster. We first show how 
to construct a neat bisection. By neat, we mean that one of the two subsets contains only 
the expander core, some attached decorations, and monomers. We then present arguments 
suggesting that this bisection, even if not optimal, is adjacent to a neat optimal bisection 
that is in the same cluster as all optimal bisections, thus replica symmetry holds for all 
a < ad- 

The construction procedure is as follows. Consider the giant component S as being 
composed of the expander core X and decorations Di as in Figure [5l These are disjoint 
subsets, with some edges connecting the to X but none connecting the to each other |2j. 
For each decoration Dj, find the nonempty subset D] C that minimizes the expansion 




(30) 
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resulting from cutting off the piece Dj. The value of hj may be interpreted as a "cost" per 
node that we attribute to each node in Dj. Now repeat this procedure on Di \ Dj, resulting 
in a new piece Df, except that this time we find the Df minimizing the marginal cost of 
cutting off Df given that Dj is already cut off, i.e., 

^2 ^ \E{S \ {D} U A^), D} U DD\ - \E{S \ Dl Dj) \ _ ^^^^ 
Recursively perform the procedure, letting 



until decoration Di has been completely partitioned into disjoint subsets Dl,...,D^. It 
is straightforward to confirm that these, by construction, are now ordered in increasing 
marginal cost h\ < ■ ■ ■ < h^, where the total cost \E{S, Di) \ of cutting the entire decoration 
is simply equal to the sum of the marginal costs over all nodes. 



m 



\E{S,D^\ = J2K\Dll (33) 
i=i 

Note that is the minimum possible cost of cutting a node in Dj. Since cuts in optimal 
solutions are restricted to decorations, generating a neat optimal bisection requires an aggre- 
gated ranking of all subsets of all decorations, Dj, according to their costs hj. Analogously 
to the greedy algorithm in Section [2], where we stripped trees in increasing order of expan- 
sion, we now strip the pieces of the decorations until the giant component's excess has been 
removed. As with the greedy algorithm, this can result in overstripping. There may also be 
numerous pieces with an equal ranking, leaving the choice of which ones to pick in order to 
overstrip as little as possible. But due to the maximal size of the decorations |2j, the largest 
of these pieces is O(logn). So regardless of which we choose, as with tree-stripping, we can 
compensate for the imbalance by transferring some of the extensive supply of monomers. 
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The result is a neat bisection, albeit not necessarily an optimal one. 

In principle, by perfectly selecting the final pieces of equal rank and thereby minimizing 
the cost of overstripping, we can even form any neat optimal bisection in this way. We discuss 
this in the next section. But no matter how we select the final pieces, it seems highly likely 
that our construction gives a solution differing from some optimal one by at most the number 
of variables contained in a finite number of these pieces, which is O(logn). Furthermore, it 
is not difficult to see that all neat (not necessarily optimal) bisections that use the identical 
choice of compensating monomers lie within a single cluster. It then follows that among these 
neat bisections, all those that are optimal are in a single cluster as well. And every neat 
optimal bisection must then be in the same cluster, since successively swapping monomers 
gives a chain of adjacent solutions connecting them. 

Finally, for similar reasons as in Section [21 any optimal bisection must be in the same 
cluster as some neat optimal bisection. If the subset containing the expander core also 
contains components other than monomers, each one can have size at most 0(log?T,) and can 
be swapped with monomers from the other subset to produce an adjacent optimal bisection. 
Iterating this process leads to a neat optimal bisection. Consequently, the argument implies 
that for 2 log 2 < a < ad, all optimal bisections lie within a single cluster, and the replica 
symmetric phase extends up to ad > ac- 

Above ad, the solution structure is less clear. When a is large enough, optimal cuts are 
no longer necessarily restricted to the decorations: the expander core likely needs to be cut 
as well. In that case, the sizes of the cut pieces could well be extensive. One could easily 
imagine extensive gaps in the solution structure, leading to multiple clusters and replica 
symmetry breaking. Although we have no direct evidence of it, we speculate that this is the 
case for a greater than some finite value, suggesting an RSB phase boundary at or above 
ad, i.e., above the critical threshold ac- 
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4 Discussion and Conclusions 



Our study points to an unusual kind of phase structure for random graph bisection. The 
typical scenario, in random combinatorial optimization problems displaying a sharp thresh- 
old in the objective function's behavior, is that an RSB transition takes place at or below the 
main critical threshold: ad < oic- By contrast, graph bisection appears to be replica sym- 
metric up to and beyond the threshold, with ad > ac- We note that this lends support to an 
analytical bisection width prediction proposed by Liao in 1987 pH[25] . Let us now comment 
on several further aspects of ad, as well as on the intriguing algorithmic consequences of this 
phase picture. 

Although ad is strictly greater than a^, it may not be much greater. As mentioned at the 
end of Section [21 our bisection width bound is only valid for a < 1.67. That bound is needed 
for being able to restrict optimal cuts to the giant component's decorations. We do not 
presently have any other quantitative bound or estimate on the specific value of ad- However, 
some further qualitative observations come from examining the procedure for obtaining the 
expander core. Roughly speaking, Benjamini et al. [2] strip the giant component down to 
what we call a 2.k-core, namely a connected subgraph of the giant component where all 
vertices have degree at least 2 (no trees) and where the largest chain is of length k. The 
2.fc-core is an expander as long as is a sufficiently large finite constant. The construction 
of [2] does not explicitly state how large k must be, or what the minimum expansion value 
h is for the 2.A;-core at a given mean degree a. But decreasing values of k clearly result in 
smaller 2.A;-cores (Figure [6]), and presumably in larger expansion as more long chains are 
removed. This would lead to a higher ad- Thus, by picking the smallest possible k (for every 
a) that still yields an expander core, the resulting expansion gives, via Eq. fl29|) . the largest 
possible ad for which we can be sure that graph bisection is replica symmetric at all a < ad- 

Algorithmically, the striking consequence of our study is that we may obtain near-optimal 
bisections and perhaps even optimal ones, for all a < ad-, in polynomial time. By near- 
optimal, we mean that the ratio of its bisection width to the optimal value approaches 1 
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Figure 6: Fraction of nodes in the 2.k-coTe, k = A through k = 7, ai a = 1.4. Numerical 
results are averaged over 200 instances, for increasing values of n. 

asymptotically. We do this by stripping the giant component down to the expander core 
(using any sufficiently large k), and then following the construction in Section [3.2.21 The 
stripping process and decoration identification [2j takes polynomial time. The next subtlety 
is breaking a decoration Di into the correct disjoint pieces Dj : an exhaustive approach to 
finding Dj involves comparing all of the 2l^'l ways of partitioning into two subsets. But 
since no decoration Di has more than 0(log?T,) nodes, this can be done in ri'^^^^ steps, and 
then all pieces can be found and ranked {h] < ■ • • < h^) in n*^^^) steps. For the final 
pieces of equal rank, even if we simply pick them in arbitrary order we will overstrip by at 
most O(logn). Thus, in polynomial time we reach a solution whose (extensive) bisection 
width is within O(logn) of the optimal one. 

Of course, the 0(1) constant in the exponent may be quite large, which could make this 
result largely a theoretical one. But it is worth noting that we have some freedom to decrease 
the size of the decorations, and thus lower the constant. If, say, instead of using the smallest 
possible k that gives an expander core, we use the largest possible k that gives expansion 
greater than our tree-cutting bound (Eq. (!29|) ). this will strip the minimum possible from the 
giant component. In this way, we may be able to minimize the size of the largest decoration. 

It would be particularly remarkable if some version of our construction could give a true 
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optimal bisection, with high probabihty, in polynomial time. For this, we need to pick the 
final pieces of equal rank so as to minimize overstripping. The difficulty is that the number 
of such pieces is likely to be infinite: an extensive number of nodes is contained collectively 
within the decorations, while there are at most 0(log^?T,) possible marginal costs (O(logn) 
possible integers for the numerator and O(logn) possible integers for the denominator). On 
the other hand, it could well be that of the infinite number of final pieces, a finite fraction 
of them have any given finite size |-D^|. In that case, a simple greedy procedure will find a 
collection of pieces that minimizes overstripping. Having minimized overstripping, one still 
needs to check whether it is possible to lower the bisection width further by restoring a small 
fragment of a cut piece. If the amount we overstrip is O(logn), then exhaustively checking 
this could require ~ dogn) steps. But more likely, one can strengthen our analysis to show 
that the final pieces are in fact of size 0(1), in which case the last operation could also be 
accomplished in polynomial time. Clearly, this intuition is far from a proof of any sort. But 
if indeed the optimal bisection can be found in poljTiomial time for a < a^, we would have 
a very unusual example of an NP-hard optimization problem whose typical-case complexity 
is polynomial at and above the main threshold ac- 

We offer two ffnal remarks about other versions of the problem. First, our approach 
suggests that the peculiarities of random graph bisection's phase structure are closely related 
to the problem's global constraint — the fact that a subset cannot be larger than n/2 — 
combined with the specific topology of the giant component in Qnp- This means that the 
problem of balanced multiway graph partitioning on Qnp graphs should have very similar 
characteristics, and can likely be analyzed through similar means. Second, the situation is 
very different if instead of Qnp we use a graph ensemble that has local geometric structure. An 
example is the random geometric graph ensemble Qnr [31j, where nodes are placed uniformly 
at random in a unit square and pairs are connected by an edge if separated by distance < r. 
In this case, the giant component is very unlikely to have any expander core due to its large 
diameter. Rather than having a tree-like periphery, it contains numerous local cliques, and 
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minimizing the bisection width involves locating the bottlenecks in between those cliques. 
But at the same time, Qnr graphs have useful lattice-like properties [15], with the optimal 
bisection width scaling [3ll [7j as w ~ y/n, precisely as one finds on a regular 2-dimensional 
lattice structure. We expect that the structure of bisections is closely related to the geometry 
of the giant component in bond percolation on a lattice, which has been studied [3]. Adapting 
that analysis could lead to nontrivial bounds on w and rapid algorithms finding very close 
approximations to the optimal cut in Qnr- 

5 Acknowledgments 

This work was supported by the U.S. Department of Energy at Los Alamos National Labo- 
ratory under contract DE-AC52-06NA25396 through the Laboratory Directed Research and 
Development Program, a Marie Curie International Reintegration Grant within the 6th Eu- 
ropean Community Framework Programme, a PN Il/Parteneriate grant from the Romanian 
CNCSIS, and a National Science Foundation grant DMR-0312510. 

References 

[1] C. J. Alpert and A. B. Kahng. Recent directions in netlist partitioning: a survey. 
Integration, the VLSI Journal, 19:1-81, 1995. 

[2] I. Benjamini, G. Kozma, and N. Wormald. The mixing time of the giant component of 
a random graph. arXiv:math. PR/0610459, 2006. 

[3] I. Benjamini and E. Mossel. On the mixing time of a simple random walk on the 
supercritical percolation cluster. Probability Theory and Related Fields, 125:408-420, 
2003. 



21 



[4] G. Biroli, R. Monasson, and M. Weigt. A variational description of the ground state 
structure in random satisfiability problems. European Physical Journal B, 14:551-568, 
2000. 

[5] S. Boettcher. Extremal optimization of graph partitioning at the percolation threshold. 
Journal of Physics A: Mathematical and General, 32:5201-5211, 1999. 

[6] S. Boettcher and A. G. Percus. Nature's way of optimizing. Artifical Intelligence, 
119:275-286, 2000. 

[7] S. Boettcher and A. G. Percus. Extremal optimization for graph partitioning. Physical 
Review E, 64:026114, 2001. 

[8] B. BoUobas. Random Graphs. Cambridge University Press, Cambridge, UK, 2001. 

[9] Y. Boykov, O. Veksler, and R. Zabih. Fast approximate energy minimization via graph 
cuts. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23:1222-1239, 
2001. 

[10] J. Cheeger. A lower bound for the smallest eigenvalue of the Laplacian. In Problems in 
analysis (Papers dedicated to Salomon Bochner, 1969), pages 195-199. Princeton Univ. 
Press, Princeton, N. J., 1970. 

[11] P. Cheeseman, B. Kanefsky, and W. F. Taylor. Where the really hard problems are. 
In Proceedings of the 12th International Joint Conference on Artificial Intelligence (IJ- 
CAr91), pages 331-337, San Mateo, CA, 1991. Morgan Kaufmann. 

[12] P. Erdos and A. Renyi. On random graphs. Publ. Math. Inst. Hungar. Acad. Sci., 1959. 

[13] Y. Fu and P. Anderson. Application of statistical mechanics to np-complete prob- 
lems in combinatorial optimisation. Journal of Physics A: Mathematical and General, 
19(9):1605-1620, 1986. 



22 



[14] M. R. Garey and D. S. Johnson. Computers and Intractability. W.H. Freeman, New 
York, 1979. 

[15] A. Goel, S. Rai, and B. Krishnamachari. Monotone properties of random geometric 
graphs have sharp thresholds. Annals of Applied Probability, 15:2535-2552, 2005. 

[16] M. Goldberg and J. Lynch. Lower and upper bounds for the bisection width of a random 
graph. Congr Numer, 49, 1985. 

[17] B. Gongalves, G. Istrate, A. Percus, and S. Boettcher. The core peehng algorithm for 
graph bisection: an experimental evaluation. Technical Report LA-UR 06-6863, Los 
Alamos National Laboratory, 2006. 

[18] P. Holme, B. J. Kim, C. N. Yoon, and S. K. Han. Attack vulnerability of complex 
networks. Phys. Rev. E, 65:056109, 2002. 

[19] G. Istrate, S. Kasiviswanathan, and A. Percus. The cluster structure of minimum 
bisections of sparse random graphs. Technical Report LA-UR 06-6566, Los Alamos 
National Laboratory, 2006. 

[20] G. Istrate, S. Kasiviswanathan, and A. G. Percus. The bisection width and solution 
cluster structure of sparse random graphs. In preparation. 

[21] S. Janson, T. Luczak, and A. Rucihski. Random Graphs. John Wiley & Sons, Inc., New 
York, 2000. 

[22] I. Kanter and H. Sompolinsky. Mean-field theory of spin-glasses with finite coordination 
number. Physical Review Letters, 58(2):164-167, 1987. 

[23] V. Kolmogorov and R. Zabih. What energy functions can be minimized via graph cuts? 
IEEE Transactions on Pattern Analysis and Machine Intelligence, 26:147-169, 2004. 

[24] W. Liao. Graph bipartitioning problem. Physical Review Letters, 59(15) :1625-1628, 
1987. 

23 



[25] W. Liao. Replica-symmetric solution of the graph-bipartitioning problem. Physical 
Review A, 37(2):587-595, 1988. 

[26] M. J. Luczak and C. McDiarmid. Bisecting sparse random graphs. Random Struct. 
Algorithms, 18:31-38, 2001. 

[27] M. Mezard and G. Parisi. Mean-field theory of randomly frustrated systems with finite 
connectivity. Europhysics Letters, 3(10): 1067-1074, 1987. 

[28] M. Mezard, G. Parisi, and R. Zecchina. Analytic and algorithmic solutions of random 
satisfiability problems. Science, 297:812-815, 2002. 

[29] M. Mezard and R. Zecchina. Random K-satisfiability problem: from an analytical 
solution to an efficient algorithm. Physical Review E, 66:056126, 2002. 

[30] D. G. Mitchell, B. Selman, and H. J. Levesque. Hard and easy distributions for SAT 
problems. In P. Rosenbloom and P. Szolovits, editors. Proceedings of the Tenth Na- 
tional Conference on Artificial Intelligence, pages 459-465, Menlo Park, California, 
1992. AAAI Press. 

[31] M. D. Penrose. Random Geometric Graphs. Oxford University Press, Oxford, 2003. 

[32] A. G. Percus, G. Istrate, and C. Moore. Computational Complexity and Statistical 
Physics. Oxford University Press, New York, 2006. 

[33] B. Pittel. On tree census and the giant component in sparse random graphs. Random 
Struct. Algorithms, 1:311-342, 1990. 



24 



